HW-Flow-Fusion: Inter-Layer Scheduling for Convolutional Neural Network Accelerators with Dataflow Architectures

نویسندگان

چکیده

Energy and throughput efficient acceleration of convolutional neural networks (CNN) on devices with a strict power budget is achieved by leveraging different scheduling techniques to minimize data movement maximize reuse. Several dataflow mapping frameworks have been developed explore the optimal CNN layers reconfigurable accelerators. However, previous works usually optimize each layer singularly, without reuse between CNNs. In this work, we present an analytical model achieve searching for communication computation across layers. We call inter-layer framework HW-Flow-Fusion, as fused map-space multiple sharing available resources same accelerator, investigating constraints trade-offs execution workloads dependencies. propose memory-efficient model, tiling, resource partitioning strategies fuse recomputation. Compared standard single-layer scheduling, can reduce volume 51% 53% selected VGG16-E ResNet18 spatial array latency 39% 34% respectively, while also increasing ratio which improves memory bandwidth efficiency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scheduling Dataflow Execution Across Multiple Accelerators

Dataflow execution engines such as MapReduce, DryadLINQ and PTask have enjoyed success because they simplify development for a class of important parallel applications. Expressing the computation as a dataflow graph allows the runtime, and not the programmer, to own problems such as synchronization, data movement and scheduling leveraging dynamic information to inform strategy and policy in a w...

متن کامل

Dataflow Architectures: Flexible Platforms for Neural Network Simulation

Dataflow architectures are general computation engines optimized for the execution of fme-grain parallel algorithms. Neural networks can be simulated on these systems with certain advantages. In this paper, we review dataflow architectures, examine neural network simulation performance on a new generation dataflow machine, compare that performance to other simulation alternatives, and discuss t...

متن کامل

Analysis of Deep Convolutional Neural Network Architectures

In computer vision many tasks are solved using machine learning. In the past few years, state of the art results in computer vision have been achieved using deep learning. Deeper machine learning architectures are better capable in handling complex recognition tasks, compared to previous more shallow models. Many architectures for computer vision make use of convolutional neural networks which ...

متن کامل

A Radon-based Convolutional Neural Network for Medical Image Retrieval

Image classification and retrieval systems have gained more attention because of easier access to high-tech medical imaging. However, the lack of availability of large-scaled balanced labelled data in medicine is still a challenge. Simplicity, practicality, efficiency, and effectiveness are the main targets in medical domain. To achieve these goals, Radon transformation, which is a well-known t...

متن کامل

An Overview of Convolutional Neural Network Architectures for Deep Learning

Since AlexNet was developed and applied to the ImageNet classi cation competition in 2012 [1], the quantity of research on convolutional networks for deep learning applications has increased remarkably. In 2015, the top 5 classi cation error was reduced to 3.57%, with Microsoft's Residual Network [2]. The previous top 5 classi cation error was 6.67%, achieved by GoogLeNet [3]. In recent years, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2022

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics11182933